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TYROSINE KINASE 
BACKGROUND OF THE INVENTION 

1. Field of The Invention 

This invention relates generally to the molecular cloning of genes which can 
5 be used in toxicity assays and, specifically, to the isolation of a mammalian 

DNA recombination and repair gene which can be used in an assay to 
screen various compositions which affect DNA repair. 

2. Related Art 

Chromosomes experience single-stranded or double-stranded breaks as a 
10 result of energy-rich radiation, chemical agents, as well as spontaneous 

breaks occurring during replication among others. Although genes present 
in the chromosomes undergo continuous damage, repair, exchange, 
transposition, and splicing, certain enzymes protect or restore the specific 
base sequences of the chromosome. 

15 The repair of DNA damage is a complex process that involves the 

coordination of a large number of gene products. This complexity is in part 
dependent upon both the form of DNA damage and cell cycle progression. 
For example, in response to ultraviolet (UV) irradiation, cells can employ 
photoreactivation or excision repair functions to correct genetic lesions. The 

20 repair of strand breaks, such as those created by X-rays, can proceed 

through recombinational mechanisms. For many forms of DNA damage, the 
cell is induced to arrest in the G2 phase of the cell cycle. During this G2 
arrest, lesions are repaired to ensure chromosomal integrity prior to mitotic 
segregation. 
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Since the transfer of genetic information from generation to generation is 
dependent on the integrity of DNA, it is important to identify those gene 
products which affect or regulate genetic recombination and repair. 
Through the use of organisms with specific genetic mutations, the normal 
5 functional gene can be obtained, molecularly cloned, and the gene products 

studied. 

Phenotypic complementation, as a way of identifying homologous normal 
functional genes, is widely used. For example, the human homologue of the 
yeast cell cycle control gene, cdc 2, was cloned by expressing a human 
cDNA library in Schizosaccharomyces pombe and selecting those clones 
which could complement a mutation in the yeast cdc 2 gene (Lee, et a/., 
Nature, 22Z:31, 1987). A mammalian gene capable of reverting the heat 
shock sensitivity of the RAS2 vaJ19 gene of yeast, has also been cloned by 
using complementation (Colicelli, et aL, Proc.Nat'lAcad.Sci. USA, 3§:3599, 
1989). A rat brain cDNA library was used to clone a mammalian cDNA that 
can complement the loss of growth control associated with the activated 
RAS2 gene in yeast The gene, DPD (dunce-like phosphodiesterase), 
encodes a high-affinity CAMP phosphodiesterase. 

In eukaryotes such as Saccharomyces cerevisiae, genetic studies have 
20 defined repair-deficient mutants which have allowed the identification of more 

than 30 radiation-sensitive {RAD) mutants (Haynes, et aL, in Molecular 
Biology of the Yeast Saccharomyces, pp. 371, 1981; J. Game in Yeast 
Genetics: Fundamental and Applied Aspects, pp. 109, 1983). These 
mutants can be grouped into three classes depending upon their 
25 sensitivities. These classes broadly define excision-repair, error-prone repair, 

and recombinational-repair functions. The molecular characterization of 
yeast RAD genes has increased the understanding of the enzymatic 
machinery involved in excision repair, as well as the arrest of cell division by 
DNA damage. 
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3. 



The understanding of RAD genes and their expression products has 
become increasingly important as research continues to develop more 
effective therapeutic compositions. Often these new compositions appear 
quite effective against a particular disease condition, such as certain tumors, 
5 but prove to be too toxic for in vivo therapy in an animal having the disease. 

Indeed, these compositions can actually increase the likelihood of 
mutagenesis. 

Most agents that are mutagenic or carcinogenic are in themselves 
unreactive, but are broken down to reactive intermediates in vivo. It is these 
reactive intermediates which interact with DNA to produce a mutation. This 
event is thought to be the initial step in chemical carcinogenesis. Mutations 
in a large number of genes affect the cellular response to agents that 
damage DNA In all likelihood, many of these mutated genes encode 
enzymes that participate in DNA repair systems. Consequently, when the 
repair system is compromised, the cells become extremely sensitive to toxic 
agents. Although the DNA may revert to normal when DNA repair 
mechanisms operate successfully, the failure of such mechanisms can result 
in a transformed tumor cell which continues to proliferate. 

20 Although there are currently available tests to determine the toxicity or 

mutagenicity of chemical agents and compositions, there are limitations in 
both laboratory screening procedures and animal toxicity tests. These 
limitations include extrapolating laboratory data from animals to humans. 
There is often a large measure of uncertainty when attempting to correlate 

25 the results obtained in laboratory animals with effects in human subjects. 

In most cases, doses of the test drug have been used in the animal which 
are too high to be safely administered to humans. In addition, some types 
of toxicity can be detected if the drug is administered in a particular species, 
yet may be missed if the experiment is not done in the correct animal 
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species. Moreover, many currently available laboratory tests are incapable 
of detecting certain types of toxic manifestations which occur in man. 

Drugs are also routinely tested for their mutagenic potential using 
microorganisms in the screening assay. The popular test developed by 
5 Ames and colleagues (Ames, er a/., Mutat. Res.:21, 347, 1975) uses 

Salmonella typhimurium containing a mutant gene for histidine synthesis. 
This bacterial strain cannot grow in a histidine deficient medium unless a 
reverse mutation is induced by exposure to a particular agent. TTie Ames 
test is rapid and sensitive, however, its usefulness in predicting carcinogenic 
10 or mutagenic potential of chemical substances in human is unclear. 

In summary, limitations and uncertainties of existing laboratory tests fail to 
provide an accurate method of examining the effects of a composition on 
DNA integrity. In view of this, a considerable need exists for screening 
methodologies which are inexpensive, rapid, and contain the relevant gene 
15 from the animal which is to be treated with the composition. Such methods 

provide a direct assay to determine if a composition interferes with the DNA 
repair system of a cell. 
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SUMMARY OF THE INVENTION 

The present invention arose from the discovery of a novel protein which is 
involved in repair of DNA strand breaks. Although this protein has kinase 
activity, it is the only kinase known to promote repair of DNA strand breaks 
5 occurring at a specific nucleotide sequence and allow normal mitotic 

recombination. The identification of the normal, or "wild-type", protein kinase 
was made possible by the isolation of a yeast mutant (hrr25) defective in 
repairing DNA strand breaks, but still capable of promoting normal mitotic 
recombination. The wild-type gene {HRR25) was isolated by screening a 
1 0 DNA library for nucleotide sequences which could restore the ability to repair 

DNA breaks. 



A major advantage of the present invention is that it now enables 
identification of functionally analogous wild-type proteins from other species, 
especially humans. The identification of such foreign protein provides the 
15 further advantage of allowing their use in a screening method designed to 

examine the effect of various compositions on the DNA break repair 
promoting activity of the foreign protein. 
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6. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 (A) shows the nucleotide and amino acid sequence of the HRR25 
gene. The locations of the prolines and glutamines at the C-terminus are 
indicated by asterisks and the limits of homology to the protein kinase 
5 catalytic domain are shown by arrows. (B) shows the protein kinase 

homology represented by a shaded region while the P/Q rich region is 
indicated by cross hatching. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention relates to a DNA recombination and repair gene which 
can be used in an assay system to examine the effects of various 
5 compositions on DNA integrity. The invention also provides a DNA 

sequence encoding a polypeptide which promotes normal mitotic 
recombination, but is defective in tyrosine kinase activity and essentially 
unable to repair DNA strand breaks. This defective DNA sequence is highly 
useful for identifying other DNA sequences which encode proteins with 

10 functional tyrosine kinase activity. These functional sequences, which can 

be characterized by their ability to restore DNA strand breaks, permit the 
screening of compositions to determine whether a particular composition 
has an effect on the restoration of such repair activity. In addition, the 
present invention relates to the polypeptide encoded by the defective DNA 

15 sequence, as well as the polypeptide encoded by the functional wild-type 

DNA. 

In order to identify a DNA sequence encoding a polypeptide with tyrosine 
kinase activity, a method is provided whereby a DNA library is screened for 
nucleotide sequences capable of restoring DNA strand break repair in a 
20 mutant lacking such activity. A method is further provided for identifying a 

composition which affects the activity of a mammalian polypeptide having 
tyrosine kinase activity, wherein the polypeptide is capable of restoring DNA 
double-strand break repair activity in a mutant lacking such activity. 
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In general, the defective protein kinase can be characterized by its ability to 
promote normal mitotic recombination, while being essentially unable to 
repair DNA double-strand break including that which occurs at the cleavage 
site: 

5 CAACAG 

GTTGTC 

t 

The DNA double-strand breaks which the defective protein kinase is 
10 essentially unable to repair can be induced by various means, including 

endonucieases, x-rays, or radiomimetic agents including alkylating agents. 
Preferred endonucieases are those which recognize the same nucleotide 
cleavage site as endonuclease HO. Radiomimetic alkylating agents having 
methylmethane sulfonate activity are preferred. Those of skill in the art will 
15 be able to identify other agents which induce the appropriate DNA strand 

breaks without undue experimentation. 

The present invention specifically discloses mutants sensitive to continuous 
expression of the DNA double-strand endonuclease HO, which codes for a 
65 kDa site-specific endonuclease that initiates mating type interconversion 

20 (Kostriken, et aL, Cold Spring Harbor Symp.QuantBioL , 49:89, 1 984) . These 

mutants are important to understanding the functions involved in recognizing 
and repairing damaged chromosomes. This invention also discloses a yeast 
wild-type DNA recombination and repair gene called HRR25 (HO and/or 
radiation repair). Homozygous mutant strains, hrr25-1, are sensitive to 

25 methylmethane sulfonate and X-rays, but not UV irradiation. The wild-type 

gene encodes a novel protein kinase, homologous to other serine/threonine 
kinases, which appears critical in activation of DNA repair functions by 
phosphorylation. 
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The HRR25 kinase is important for normal cell growth, nuclear segregation. 
DNA repair and meiosis, and deletion of HRR25 results in cell cycle defects. 
These phenotypes, coupled wfth the sequence similarities between the 
HRR25 kinase and the Raflc-mos protein kinase subgroup suggest that 
5 HRR25 might play a similar role in S. cerevisiae growth and development. 

The defects in DNA strand break repair and the aberrant growth properties 
revealed by mutations in HRR25 kinase, expands the role that protein 
kinases may play and places HRR25 in a functional category of proteins 
associated with DNA metabolism. 

10 The development of specific DNA sequences encoding protein kinase 

polypeptides of the invention can be accomplished using a variety of 
techniques. For example, methods which can be employed include (1) 
isolation of a double-stranded DNA sequence from the genomic DNA of the 
eukaryote; (2) chemical synthesis of a DNA sequence to provide the 

15 necessary codons for the polypeptide of interest; and (3) in vitro synthesis 

of a double stranded DNA sequence by reverse transcription of mRNA 
isolated from a eukaryotic donor cell. In the latter case, a double stranded 
DNA complement of mRNA is eventually formed which is generally referred 
to as cDNA. 

20 The novel DNA sequences of the invention include ail sequences useful in 

providing for expression in prokaryotic or eukaryotic host cells of 
polypeptides which exhibit the functional characteristics of the novel protein 
kinase of the invention. These DNA sequences comprise: (a) the DNA 
sequences as set forth in Figure 1 or their complementary strands; (b) DNA 

25 sequences which encode an amino acid sequence with at least about 35% 

homology in the protein kinase domain with the amino acid sequences 
encoded by the DNA sequences defined in (a) or fragments thereof; and (c) 
DNA sequences defined in (a) and (b) above. Specifically embraced in (b) 
are genomic DNA sequences which encode allelic variant forms. Part (c) 
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specifically embraces the manufacture of DNA sequences which encode 
fragments of the protein kinase and analogs of the protein kinase wherein 
the DNA sequences thereof may incorporate codons which facilitate 
translation of mRNA. Also included in part (c) are DNA sequences which 
are degenerate as a result of the genetic code. 

Since the DNA sequence of the invention encodes essentially the entire 
protein kinase molecule, it is now a routine matter to prepare, subclone, and 
express smaller polypeptide fragments of DNA from this or a corresponding 
DNA sequence. The term "polypeptide" denotes any sequence of amino 
acids having the characteristic activity of the mutant or wild-type protein 
kinase of the invention, wherein the sequence of amino acids is encoded by 
all or part of the DNA sequences of the invention. 

The polypeptide resulting from expression of the DNA sequence of the 
invention can be further characterized as being free from association with 
other eukaryotic polypeptides or other contaminants which might otherwise 
be associated with the protein kinase in its natural cellular environment. 

Isolation and purification of microbially expressed polypeptides provided by 
the invention may be by conventional means including, preparative 
chromatographic separations and immunological separations involving 
monoclonal and/or polyclonal antibody preparation. 

In general, expression vectors useful in the present invention contain a 
promoter sequence which facilitates the efficient transcription of the inserted 
eukaryotic genetic sequence. The expression vector typically contains an 
origin of replication, a promoter, and a terminator, as well as specific genes 
which are capable of providing phenotypic selection of the transformed cells. 
The transformed hosts can be grown in fermentors and cultured according 
to techniques known in the art to achieve optimal cell growth. The 
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polypeptides of the invention can then be isolated from the growth medium, 
cellular lysates, or cellular membrane fractions. 

The DNA sequences of the present invention can be expressed in vivo in 
either prokaryotes or eukaryotes. Methods of expressing DNA sequences 
5 containing eukaryotic coding sequences in prokaryotes are well known in 

the art. Biologically functional viral and plasmid DNA vectors used to 
incorporate DNA sequences of the invention, for expression and replication 
in the host cell are well known in the art. For example, DNA can be inserted 
in yeast using appropriate vectors and introducing the product into the host 

10 cells. Various shuttle vectors for the expression of foreign genes in yeast 

have been reported (Heinemann, etal., Nature, 34Q:205, 1989; Rose, et ai, 
Gene, 6Q:237, 1987). Those of skill in the art will know of appropriate 
techniques for obtaining gene expression in both prokaryotes and 
eukaryotes, or can readily ascertain such techniques, without undue 

15 experimentation. 

Hosts include microbial, yeast and mammalian host organisms. Thus, the 
term "host" is meant to include not only prokaryotes, but also such 
eukaryotes such as yeast, filamentous fungi, as well as plant and animal 
cells which can replicate and express an intron-free DNA sequence of the 
20 invention. The term also includes any progeny of the subject cell. It is 

understood that not all progeny are identical to the parental cell since there 
may be mutations that occur at replication. However, such progeny are 
included when the terms above are used. 

Transformation with recombinant DNA may be carried out by conventional 
25 techniques well known to those skilled in the art. Where the host is 

prpkaryotic, such as E. coli, competent cells which are capable of DNA 
uptake can be prepared from cells harvested after exponential growth phase 
and subsequently treated by the CaCI 2 method using procedures well 
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known in the art. Alternatively, MgCI 2 or RbCI could be used in the reaction. 
Transformation can also be performed after forming a protoplast of the host 
cell. 

Where the host is a eukaryote, various methods of DNA transfer can be 
5 used. These include transfection of DNA by calcium phosphate-precipitates. 

conventional mechanical procedures such as microinjection, insertion of a 
plasmid encased in liposomes, spheroplast electroporation, salt mediated 
transformation of unicellular organisms or the use of virus vectors. 

Analysis of eukaryotic DNA has been greatly simplified since eukaryotic DNA 

10 can be cloned in prokaryotes using vectors well known in the art. Such 

cloned sequences can be obtained easily in large amounts and can be 
altered in vivo by bacterial genetic techniques and in vitro by specific 
enzyme modifications. To determine the effects of these experimentally 
induced changes on the function and expression of eukaryotic genes, the 

1 5 rearranged sequences must be taken out of the bacteria in which they were 

cloned and reintroduced into a eukaryotic organism. Since there are still 
many functions in eukaryotic cells which are absent in prokaryotes, (e.g., 
localization of ATP-generating systems to mitochondria, association of DNA 
with histones, mitosis and meiosis, and differentiation of cells), the genetic 

20 control of such functions must be assessed in a eukaryotic environment. 

Cloning genes from other eukaryotes in yeast has been useful for analyzing 
the cloned eukaryotic genes as well as other yeast genes. A number of 
different yeast vectors have been constructed for this purpose. All vectors 
replicate in E. coli, which is important for amplification of the vector DNA. 

25 All vectors contain markers, e.g., LEU 2, HIS 3. URA 3. that can be selected 

easily in yeast. In addition, these vectors also carry antibiotic resistance 
markers for use in E. coli. 
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Many strategies for cloning human homoiogues of known yeast genes are 
known in the art. These include, but are not limited to: 1) low stringency 
hybridization to detect shared nucleotide sequences; .2) antibody screening 
of expression libraries to detect shared structural features; and 3) 
5 complementation of mutants to detect genes with similar functions. 

For purposes of the present invention, protein kinases which are 
homologous can be identified by structural as well as functional similarity. 
Structural similarity can be determined, for example, by assessing amino 
acid homology or by screening with antibody, especially a monoclonal 
10 antibody, which recognizes a unique epitope present on the protein kinases 

of the invention. When amino acid homology is used as criteria to establish 
structural similarity, those amino acid sequences which have homology of 
at least about 35% in the protein kinase domain are considered to be 
essentially the same as the amino acid sequences of the invention. 

When homologous amino acid sequences are evaluated based on functional 
characteristics, then a homologous amino acid sequence is considered 
equivalent to the amino acid sequence of the invention when the 
homologous sequence is essentially unable to repair (in the case of the 
repair defective mutant gene) or able to repair (in the case of the natural 
gene), DNA double-strand breaks, including that which occurs at a 
nucleotide cleavage site 

CAACAG 

GTTGTC 

t 

and when the homologous amino acid sequence allows normal mitotic 
recombination. 

This invention preferably uses the functional screening method whereby 
genes are cloned from plasmid libraries by complementation of a recessive 
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marker. A recipient strain such as Saccharomyces cerevisiae is constructed 
that carries a recessive mutation in the gene of interest. This strain is then 
transformed with a piasmid, for example, pYES2 (Invitrogen, San Diego. CA) 
containing the wild-type genomic DNA or cDNA. The clone carrying the 
5 gene of interest can then be selected by replica plating to a medium that 

distinguishes mutant from wild-type phenotypes for the gene of interest. 
The piasmid can then be extracted from the clone and the DNA studied. 
Several yeast vectors allow the application of complementation systems to 
go beyond isolation of yeast genes. Genes from a wide variety of species 
1 o can be isolated using these vectors. In such systems, DNA sequences from 

any source are cloned into a vector and can be screened directly in yeast 
for activities that will complement specific yeast mutations. 

In a preferred embodiment, the present invention uses a mutation in yeast, 
the hrr25 mutation, which was identified by sensitivity to DNA double-strand 
15 breaks induced by the HO endonuclease. The genomic DNA which 

complements this mutation was isolated by transforming the hrr25 strain with 
a DNA library and subsequently screening for methylmethane sulfonate 
(MMS) resistance. Alternately, functional genes from a variety of mammalian 
species can now be cloned using the system described. 

20 Yeast genes can be cloned by a variety of techniques, including use of 

purified RNA as hybridization probes, differential hybridization of regulated 
RNA transcripts, antibody screening, transposon mutagenesis, cross 
suppression of mutant phenotypes, cross hybridization with heterologous 
cDNA or oligonucleotide probes, as well as by complementation in E. coli. 

25 Minor modifications of the primary amino acid sequence may result in 

proteins which have substantially equivalent or enhanced activity as 
compared to the sequence set forth in Figure 1 . The modifications may be 
deliberate, as by site-directed mutagenesis, or may be spontaneous by 
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HRR25 producing organisms. All of these modifications are included in the 
invention as long as HRR25 activity is retained. Substitution of an aspartic 
acid residue for a glycine acid residue at position 151 in the sequence 
shown in FIGURE 1 identifies the mutant hrr25. 

5 Antibodies provided by the present invention are immunoreactive with the 

mutant polypeptides and/or the naturally occurring protein kinase. Antibody 
which consist essentially of numerous monoclonal antibodies with different 
epitopic specificities, as well as distinct monoclonal antibody preparations 
are provided. Monoclonal antibody is made from antigen containing 
10 fragments of the polypeptide by methods well known in the art (Kohler,G. 

ef a/., Nature 256:495, 1975; Current Protocols in Molecular Biology, 
Ausubel, F. ef a/ M ed.,1989). 

The invention also discloses a method for identifying a composition which 
affects the activity of a polypeptide having tyrosine kinase activity. The 

15 polypeptide is capable of restoring DNA double-strand break repair activity 

in host cells containing the hrr25 gene. The composition and the 
polypeptide are incubated in combination with host cells for a period of time 
and under conditions sufficient to allow the components to interact, then 
subsequently monitoring the change in tyrosine kinase activity, for example, 

20 by decreased repair of DNA double-strand breaks. The DNA strand breaks 

are induced, for example, by a radiomimetic agent, such as methylmethane 
sulfonate, x-rays, or by endonuclease like HO. Other means of inducing 
double-strand breaks that are well known in the art may be employed as 
well. 

25 The following examples are intended to illustrate but not limit the invention. 

While they are typical of those that might be used, other procedures known 
to those skilled in the art may alternatively be used. 
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EXAMPLE 1 

ISOLATION OF hrr25 

S. cerevisiae strain K264-5B (MATa ho um3 can1 R tyr1 his7 Iys2 ade5 met13 
trp5 Ieu1 adeS) was employed for the mutant isolation. The yeast were 
5 transformed according to standard procedures with a URA3-based 

integrating plasmid that contained a GAL1, 70-regulated HO endonuclease 
and a transformant was mutagenized to approximately 50% survival with 
ethyl methanesulfonate (EMS), as described (Current Protocols in Molecular 
Biology, supra). The culture was spread onto glycerol-containing rich 

1 0 medium (YPG, to avoid petites), colonies were allowed to form at 30 °C, and 

plates were replicated to glucose (HO repressing) and galactose (HO 
inducing) media. Mutants were identified by their inability to grow on 
galactose. Approximately 200 mutants were chosen for initial 
characterization and 62 maintained the gal- phenotype through repeated 

15 single colony purification. Among these, many were not complemented by 

various gal mutants. The remainder (25 mutants) were surveyed for 
overlapping DNA repair defects by determining sensitivity to ultraviolet (UV) 
irradiation and to methyl methane sulfonate (MMS). This screening method 
identified five alleles of known rad mutations and one new mutation. This 

20 new mutation hrr25-1 (HO and/or radiation repair), presented severe defects 

and was studied further. 

A recessive DNA repair defect is conferred by hrr25-1 that includes 
sensitivity to MMS. Hrr25-1 strains also show sensitivity at 5-20 Krad X- 
irradiation similar to that observed with mutations in the radiation repair 
25 genes RAD50 and RAD52 (Cole, ef a/., MoLCeU.Bioi, 2:3101. 1989). The 

hrr25-1 strains are no more sensitive to UV irradiation than wild type and are 
not temperature sensitive for growth at 37° C. Unlike hypo- and hyper-rec 
rad mutants which have several of the hrr25-1 phenotypes, hrr25-l strains 
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undergo normal mitotic recombination (Cole, et al., Mol.Cell.BioL, 9:3101, 
1989). Spontaneous gene conversion and crossing-over were the same for 
homozygous hrr25-1 and wild type strains. However, HRR25 is required for 
the correct completion of meiosis. The hrr25-1 homozygotes showed less 
5 than 1 % spores (tetranucleate cells) under conditions that produced 75-80% 

spores in an isogenic wild type strain. The hrr25-1 mutation could be 
complemented by a number of radiation sensitive mutations (rad6, 50, 52, 
54, and 57) that present some of the hrr25 phenotypes, suggesting that 
hrr25-1 is a newly uncovered rad-like mutation and not one of these 
10 previously described genes. These results also indicate that HRR25 plays 

a role in DNA repair and meiosis, but is not specifically required for the 
repair of spontaneous mitotic lesions by recombination. 



EXAMPLE 2 
ISOLATION OF HRR25 

1 5 The HRR25 gene was obtained by complementing for MMS sensitivity using 

a yeast genomic library constructed in the plasmid YCp50 (Rose, et al., 
Gene, 60:237, 1987). An hrr2S-1 strain, MHML 3-36d (ura3 hrr25), was 
transformed by standard methods (Nickoloff, ef al., J.Mol.Biol., 207:527, 
1989) to uracil prototrophy, transformants were amplified on media without 

20 uracil and replicated to media containing 0.01% MMS. Among 1200 

transformants, a single MMS resistant isolate was identified. 
Complementation for MMS sensitivity was found to segregate with the 
plasmid as determined by methods known in the art. 



A 12 kb genomic fragment was identified and complementing activity was 
25 localized to a 3.1 kb BamHI-Sa/I fragment by transposon mutagenesis and 

subcloning. This region complemented DNA repair defects as well as 
meiotic deficiencies. Gene targeting experiments linked this cloned region 
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to hrr25-1. Transposon insertion mutations within the SamHI-Sa/l fragment 
replaced into the cognate HRR25 genomic locus did not complement hrr25-1 
for MMS sensitivity, whereas adjacent chromosomal insertions outside the 
complementing region segregated in repulsion when crossed against 
5 hn25-1. 

Mini-TnrOLt/Ktransposons (Huisman, era/., Genetics, 116_:191, 1987) were 
used to delineate the approximate location of HRR25 on the 12 kb 
BamH\Sal\ fragment. Insertions located to the left hand 9 kb (of the 12 kb 
genomic fragment) did not inactivate complementation of hrr25-l MMS 

10 resistance compared with the un-mutagenized plasmid. Two insertions, 

located near an EcoRV site in the right hand 2 kb inactivated 
complementation. HRR25 complementation activity was localized to a 3.4 
kb Sail fragment. Approximately 300 bp of this fragment (right hand side 
of the 12 kb) were part of the pBR322 tetracycline resistance gene (between 

1 5 the BamHl site of pBR322-based YCpSO). The HRR25 open reading frame 

spans an internal region across an EcoRV site and two Bgl\\ sites within the 
right terminal 3 kb. 

The DNA sequence of the 3.1 kb fragment revealed a centrally located open 
reading frame of 1482 nucleotide. A transposon insertion mutation in this 

20 open reading frame inactivated HRR25 complementation whereas insertions 

elsewhere in the 12 kb clone did not affect HRR25 complementation. 
Transposon-mediated disruption of HRR25 also revealed several phenotypes 
not seen with hrr25-1. As expected, a TntO-based LUK transposon insertion 
(Huisman, er at., Genetics, 11&191, 1987) into the middle of plasmid-borne 

25 HRR25 coding region inactivated complementation for MMS sensitivity. 

Transplacement of this insertion into the genomic HRR25 gene revealed a 
severe growth defect in addition to MMS sensitivity and meiotic inviability. 
This severe growth defect was not observed with hrr25-1 strains. Wild type 
HRR25 strains doubled in rich media at 30' C every 80-90 minutes whereas 
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isogenic hrr25::LUK strains and hn25t doubled every 9-12 hours. hrr25-1 
had a doubling time of 2-4 hours. 

To determine whether the mutant phenotypes revealed by the hrn:LUK 
disruption allele represent a null phenotype, the entire HRR25 coding 
5 sequence was deleted. Briefly, deletion of the HRR25 coding sequence 

employed a hisG::URA3::hisG cassette (Alani, ef ai, Genetics, Hf:541, 
1988). The 3.1 kb HRR25 Sail fragment was cloned into pBluescript 
(Stratagene, La Jolla, CA). This plasmid was digested with Bg/II and the two 
flg/ll fragments that span the entire HRR25 gene and its flanking sequences 

10 were deleted. Into this deletion was introduced the 3.8kb BamHI-Bg/ll 

hisG::URA3::hi$G fragment from pNKY51 to create the hn25t allele. Sail 
digestion yielded a linearized fragment that deleted the entire HRR25 locus. 
Yeast carrying the deletion-disruption allele (ftrr25A) showed phenotypes 
identical to those with the hrr25::LUK allele for all properties examined, 

1 5 including MMS sensitivity, slow growth, and the sporulation defect, indicating 

that wild-type HRR25 protein is associated with these processes and that the 
hrr25::LUK allele does not indirectly interfere with DNA repair, growth or 
sporulation. In direct parallel comparisons, the hrr25::LUK and hrr25* alleles 
behaved identically. 

20 Yeast strain MFH14 (MATalMAT* ura3/ura3) was transformed with 

Bg/ll-linearized YCp50-HW?25::LUK to uracil prototrophy, heterozygous 
disruption of HRR25 was verified by Southern blot analysis, the diploid was 
sporulated by starvation for nitrogen and fermentable carbon source, tetrads 
dissected and cells allowed to germinate at 30° C for 7 days. After a normal 

25 germination period of 2 days, the severe growth defect of hrr25:±UK 

suggested that the deletion of HRR25 was lethal. However, microscopic 
examination of segregants revealed that /7/t25::LUK germinating cells grew 
slowly and in every case examined (20/20 tetrads), slow growth, MMS 
sensitivity, and uracil prototrophy co-segregated. A color variation was seen 
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seen with diploid MFH14 segregants, due to mutations in adenine 
biosynthesis. MFH14 is ade5fADE5 ade2/ade2. An ade5/ade2 strain was 
white, while an ADE5/ade2 strain was red. 

EXAMPLE 3 

5 SEQUENCE AND STRUCTURE OF THE HRR2S GENE 

DNA sequencing of both strands of the HRR25 gene was done by uni- 
directional deletions employing Sequenase (USB, Cleveland, OH) and Exo- 
Meth (Stratagene, La Jolla, CA) procedures as described by the 

10 manufacturers. Figure 1 A, shows the location of the prolines and glutamines 

at the C-terminus as indicated by asterisks, and the limits of homology to 
protein kinase catalytic domains. Figure 1B shows a schematic 
representation of the structure of HRR25. The protein kinase homology is 
represented by a shaded region while the P/Q rich region is indicated by 

15 cross-hatchings. The mutant, hrr25, can be distinguished from HRR25 by 

one amino acid substitution. At position 151, an aspartic acid is substituted 
for glycine. 

The predicted translation product of HRR25 revealed an unexpected feature 
for a rad-like DNA repair function. HRR25 contains the hallmark signatures 

20 of sequence homology with the catalytic domain of serine/threonine protein 

kinase superfamily members (Hanks, et ai, Science, 241:42, 1988). For 
comparison, the HRR25 translation product was aligned with the catalytic 
domains for two subgroups of yeast protein kinases, the CDC28/cdc2 group 
and the KSS1/FUS3 group. Located between amino acids 15 and 30 is a 

25 region that contains the conserved GXGXXG region. Just C-terminal to this 

region is a conserved lysine and glutamic acid present in most known 
kinases. These regions are thought to function in the nucleotide binding 
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and phosphotransfer steps of the kinase reaction (Hanks, er a/., Science, 
241:42,1988). Between amino acid residues 120 to 150 are regions 
containing the HRD and DFG motifs, also found in most protein kinase 
family members. In addition, sequence examination of ail known 
5 serine/threonine kinases indicates that HRR25 shares some additional 

similarities with the RaflPKSImos subgroup (Hanks, et al., Science, 241:42. 
1988). The strongest homologies can be found in areas around the 
GXGXXG, DFG, and DXXSXG conserved regions in protein kinase catalytic 
domains. 

1 0 The functional relevance of the observed sequence similarity between HRR25 

and protein kinases was studied by altering specific residues within the 
HRR25 kinase domain and examining the phenotypic consequences of these 
changes. A lysine at position 38 (Lys *) was mutated to an arginine residue 
by site directed mutagenesis, by methods known in the art. The mutagenic 

15 oligonucleotide was: 

S'-CCTGATCGATTCCAGCCTGATCGCTACTTCTTCACCACT-S*. 

Lys 38 in HRR25 corresponds to the lysine found in all known protein kinases, 
and this subdomain is involved in ATP binding. Mutations at the conserved 
lysine in protein kinases such as v-src, v-mos, and DBF2 inactivate these 
20 proteins. The mutant fcntfS-Lys 38 allele was incapable of complementing 

hrr25-1, hrr25::LUK, and hrr25n alleles for all properties examined, an 
indication that the HRR25 kinase domain is required for in vivo function of 
HRR25. 

The predicted HRR25 translation product has a number of notable features 
25 outside the region of homology to protein kinase catalytic domains. For 

example, the last 100 amino acids is proline and glutamine rich, containing 
50 of these residues. Other proteins with regions rich in these two amino 
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acids include the transcription factors Sp1,jun, and HAP2, steroid hormone 
receptors, the S. pombe rani kinase, and mak-male germ cell-associated 
kinase (Courey, et aL, Cell, 5§:887, 1988; Bohmann, et aL, Science, 
222:1386, 1987; Roussou, et a/., MoLCelLBioL, 3:2132, 1988; Arriza, et aL, 
5 Science, 22Z:268, 1987; Matsushime, et aL, MoLCelLBioL, 1Q:2261, 1990). 

In the case of Sp1 and jun, the proline-glutamine regions are involved in 
transactivation, whereas the P/Q region in the human mineralocorticoid 
receptor is thought to serve as an intramolecular bridge. This proline- 
glutamine region in HRR25 might function as a structural feature for 

10 substrate interaction, or for subcellular localization. Also, the glutamine 

richness of this region is similar to the opa or M-repeat seen in the 
Drosophila and Xenopus Notch/Xotch proteins (Wharton, et aL, Cell, 40:55, 
1985; Coffman, ef aL, Science, 242:1438, 1990). The function of the opa 
repeat is not certain, but it is found in several Drosophila genes. Lastly, the 

15 sequence TKKQKY at the C-terminal end of the region homologous to 

protein kinases is similar to the nuclear localizing signal of SV40 large T 
antigen and yeast histone H2B (Silver, ef aL, J.CelLBioL, 103:983, 1989; 
Moreland, et aL, MoLCelLBioL, 7:4048, 1987). 

EXAMPLE 4 

20 MICROSCOPIC ANALYSIS OF GERMINATING AND 

PROLIFERATING hrr25 CELLS 

Photomicrographs of HRR25 and hrr25::LUK colonies were taken after 
germination on rich medium. An MFH14 hn25::LUK heterozygous 
transformant was dissected onto a thin film of YPD rich medium on a 
25 sterilized microscope slide and segregants were allowed to germinate under 

a coverslip by incubating the slide in a moist 30 °C chamber. Photographs 
of colonies were taken after 2 days of growth. Phase contrast and DAPI 
staining of proliferating HRR25& and hn25::LUK cells were compared. Cells 
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were inoculated into YPD nrii meciium and grown at 30° C to a mid-log 
density of 1-3 X 10 7 cells/ml, briefly sonicated to disrupt clumps, fixed with 
formaldehyde, and stained with DAPI (Williamson, et a/., Meth.Cell.Biol., 
12:335, 1975). Many cells with hn25::LUK lacked DAPI stainable nuclei. 

5 Microscopic examination of germinating and actively growing mid-log phase 

hrr25::LUK cells revealed aberrant cellular morphologies. Transposon 
disruption of HRR25 resulted in large cells, and 25-40% of cells were 
filamentous or extended. DAPI nuclear staining (Williamson, et a/., 
Meth.Cell.BioL, 12:335, 1975) of mid-log populations showed that orderly 

10 cell cycle progression in hrr25 mutants was lost. There were a large number 

of cells lacking DAPI-stainable nuclei which, by single cell manipulations 
proved to be inviable. Consistent with this nuclear segregation defect, the 
plating efficiency of hn25::LUK haploids was also reduced to 75-80% of wild 
type. However, this reduction in plating efficiency is insufficient to account 

15 for the severe growth rate reduction. Plating efficiency was measured from 

mid-log phase cells by comparing the efficiency of colony formation on rich 
medium relative to the total number of cells determined by hemocytometer 
count Cell populations were analyzed for DNA content distribution by flow 
cytometric analysis following staining with propidium iodide as described 

20 (Hutter, et al. J.Gen.MicrobioL, 113:369, 1 979). Cell sorting analysis showed 

that a large number of the cells in a haploid hn25::LUK population were 
delayed in the cell cycle and exhibited G2 DNA content, but the population 
was not arrested uniformly in the cell cycle. 
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EXAMPLE 5 

SEQUENCE COMPARISON OF HRR25 WITH 
CDC2& JC$ST f AND RAF1 

The predicted translation product of HRR25 was compared with the catalytic 
domains of several members of the serine/threonine protein kinase 
superfamily. Initial sequence comparisons employed the UWGCG programs 
(Devereux, et al., NucAcids.Res., 12:387, 1984), whereas subgroup 
comparisons used the methods of Hanks, et a/., supra, HRR25 contains all 
eleven subdomains described by Hanks, et aL, supra. Structurally similar 
groupings were compared in the sequence comparisons. These included 
nonpolar chain R groups, aromatic or ring-containing R groups, small R 
groups with near neutral polarity, acidic R groups, uncharged polar R 
groups, and basic polar R groups. 

CDC28 and KSS1 represent members of two subgroups of serine/threonine 
protein kinases in yeast. CDC28 is involved in cell cycle regulation while 
KSS1 acts in the regulation of the yeast mating pathway. HRR25 shows 
21% identity and 41% similarity to CDC28 and 19% identity and 43% 
similarity to KSS1. HRR25 shows highest similarity to members of the 
RaWPKS/Mos family of protein kinases. Through the catalytic domain. 
HRR25 shows 30% identity and 49% similarity to fla/7. 
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SUMMARY OF SEQUENCES 

Sequence I.D. No. 1 is the nucleic acid sequence (and the deduced amino 
acid sequence) of a genomic fragment encoding a yeast-derived tyrosine 
kinase of the present invention. 



5 Sequence I.D. No. 2 is the deduced amino acid sequence of a yeast-derived 

tyrosine of the present invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Hoeksnra, Merl F. 
(ii) TITLE OF INVENTION: TYROSINE KINASE 
5 (iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Spensley Horn Jubas & Lubitz 

(B) STREET: 4225 Executive Square, Suite 1400 
10 (C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 
15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 
20 (A) APPLICATION NUMBER: US 

(B) FILING DATE: 03 -JUL- 1991 

(C) CLASSIFICATION: 
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(viii) ATTORNEY/ AG ENT INFORMATION: 

(A) NAME: Wecherell Ph.D., John R. 

(B) REGISTRATION NUMBER: 13,678 

(C) REFERENCE/ DOCKET NUMBER: PD-1318 

5 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619) 455-5100 

(B) TELEFAX: (619) 455-5110 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 3098 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



15 (vii) IMMEDIATE SOURCE: 

(B) CLONE: Tyrosine Kinase 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 879.. 2364 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTCGACTCGC CAATCACCAA GTTCTTATCC CACATCCCAC CAGTGTCTGA GTCATGGTTT 60 

ACCACCACCA TACCATCGCT GGTCATTTGT AAATCCGTTT CTATTACATC AGCACCTGCT 120 

GCATAAGCCT TCTCAAATGC TAGTAGCGTA TTTTCAGGAT ATCTTGCTTT AAAAGCTCTG 180 

5 TGGCCCACAA TTTCAACCAT CCTCGTGTCC TTGTTGTTAT CTTACACTTC TTATTTATCA 240 

ATAACACTAG TAACATCAAC AACACCAATT TTATATCTCC CTTAATTGTA TACTAAAAGA 300 

TCTAAACCAA TTCGGTATTG TCCTCGATAC GGCATGCGTA TAAAGAGATA TAATTAAAAG 360 

AGGTTATAGT CACGTGATGC AGATTACCCG CAACAGTACC ACAAAATGGA TACCATCTAA 420 

TTGCTATAAA AGGCTCCTAT ATACGAATAA CTACCACTGG ATCGACGATT ATTTCGTGGC 480 

10 AATCATATAC CACTGTGAAG AGTTACTGCA ACTCTCGCTT TGTTTCAACG CTTCTTCCCG 540 

TCTGTGTATT TACTACTAAT AGGCAGCCCA CGTTTGAATT TCTTTTTTTC TGGAGAATTT 600 

TTGGTGCAAC GAGGAAAAGG AGACGAAGAA AAAAAGTTGA AACACGACCA CATATATGGA 660 

ACGTGGTTGA AATACAAAGA GAAGAAAGGT TCGACACTCG AGGAAAGCAT TTGGTGGTGA 720 

AAACACATCT TAGTAGCATC TTTAAACCTC TGTTGGGTAC TTAGAAAAAT ATTTCCAGAC 780 

15 TTCAAGGATA AAAAAAGTCG AAAAGTTACG ACATATTCGA CCAAAAAAAA AAACCAAAAA 840 
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GAAAAGATAT ATTTATAGAA AGGATACATT AAAAAGAG ATG GAC TTA AGA GTA 893 

Met Asp Leu Arg Val 
1 5 

GGA AGG AAA TTT CGT ATT GGC AGG AAG ATT GGG AGT GGT TCC TTT GGT 941 
5 Gly Arg Lys Phe Arg He Gly Arg Lys He Gly Ser Gly Ser Phe Gly 

10 15 20 

GAC ATT TAC CAC GGC ACG AAC TTA ATT AGT GGT GAA GAA GTA GCC ATC 989 
Asp He Tyr His Gly Thr Asn Leu He Ser Gly Glu Glu Val Ala He 
25 30 35 

10 AAG CTG GAA TCG ATC AGG TCC AGA CAT CCT CAA TTG GAC TAT GAG TCC 1037 

Lys Leu Glu Ser He Arg Ser Arg His Pro Gin Leu Asp Tyr Glu Ser 
40 45 50 • 

CGC GTC TAC AGA TAC TTA AGC GGT GGT GTG GGA ATC CCG TTC ATC AGA 1085 
Arg Val Tyr Arg Tyr Leu Ser Gly Gly Val Gly He Pro Phe He Arg 
15 55 60 65 

TGG TTT GGC AGA GAG GGT GAA TAT AAT GCT ATG GTC ATC GAT CTT CTA 1133 
Trp Phe Gly Arg Glu Gly Glu Tyr Asn Ala Met Val He Asp Leu Leu 
70 75 80 85 

GGC CCA TCT TTG GAA GAT TTA TTC AAC TAC TGT CAC AGA AGG TTC TCC 1181 
20 Gly Pro Ser Leu Glu Asp Leu Phe Asn Tyr Cys His Arg Arg Phe Ser 

90 95 100 



TTT AAG ACG GTT ATC ATG CTG GCT TTG CAA ATG TTT TGC CGT ATT CAG 
Phe Lys Thr Val He Met Leu Ala Leu Gin Met Phe Cys Arg He Gin 
105 HO H5 



1229 
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20 



TAT ATA CAT GGA AGG TCG TTC ATT CAT AGA GAT ATC AAA CCA GAC AAC 
Tyr lie His Gly Arg Ser Phe He His Arg Asp He Lys Pro Asp Asa 
120 125 



1277 



130 



TTT TTA ATG GGG GTA GGA CGC CGT GGT AGC ACC GTT CAT GTT ATT GAT 
Phe Leu Mec Gly 7.1 Gly Arg Arg Gly Ser Thr Val His Val lie Asp 
135 

TTC GGT CTA TCA AAG AAA TAC CGA GAT TTC AAC ACA CAT CGT CAT ATT 

Phe Gly Leu Ser Lys Lys Tyr Arg Asp Phe Asn Thr His Arg His lie 

155 160 
150 155 

CCT TAC AGG GAG AAC AAG TCC TTG ACA GGT ACA GCT CGT TAT GCA ACT 
Pro Tyr Arg Glu Asn Lys Ser Leu Thr Gly Thr Ala Arg Tyr Ala Ser 
170 1" 180 

GTC AAT ACG CAT CTT GGA ATA GAG CAA ACT AGA AGA GAT GAC TTA GAA 
Val Asn Thr His Leu Gly He Glu Gin Ser Arg Arg Asp Asp Leu Glu 
185 150 1" 

TCA CTA GGT TAT GTC TTG ATC TAT TTT TGT AAG GGT TCT TTG CCA TGG 
Ser Leu Gly Tyr Val Leu lie Tyr Phe Cys Lys Gly Ser Leu Pro Trp 
200 205 210 

GAG GGT TTG AAA GCA ACC ACC AAC AAA CAA AAG TAT GAT CGT ATC ATG 
Gin Gly Leu Lys Ala Thr Thr Lys Lys Gin Lys Tyr Asp Arg He Met 
215 220 225 

CAA AAG AAA TTA AAC GTT AGC GTC GAA ACT CTA TGT TCA GCT TTA CCA 
G1 „ Lvs Lys U. Asn V.1 S« V.1 =lu mr U„ Cys U. Gly Leu so 

935 240 245 

230 



1325 



1373 



1421 



1469 



1517 



1565 



1613 
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TTA GAG TTT CAA GAA TAT ATG GCT TAC TGT AAG AAT TTG AAA TTC GAT 
Leu Glu Phe Gin Glu Tyr MeC Ala Tyr Cys Lys Asn Leu Lys Phe Asp 
250 255. 260 



1661 



GAG AAG CCA GAT TAT TTG TTC TTG GCA AGG CTG TTT AAA GAT CTG AGT 
Glu Lys Pro Asp Tyr Leu Phe Leu Ala Arg Leu Phe Lys Asp Leu Ser 
265 270 275 



1709 



ATT AAA CTA GAG TAT CAC AAC GAC CAC TTG TTC GAT TGG ACA ATG TTG 
lie Lys Leu Glu Tyr His Asn Asp His Leu Phe Asp Trp Thr Met Leu 
280 285 290 



1757 



10 CGT TAC ACA AAG GCG ATG GTG GAG AAG CAA AGG GAC CTC CTC ATC GAA 

Arg Tyr Thr Lys Ala Met Val Glu Lys Gin Arg Asp Leu Leu He Glu 
295 300 305 



1805 



15 



AAA GGT GAT TTG AAC GCA AAT AGC AAT GCA GCA AGT GCA AGT AAC AGC 
Lys Gly Asp Leu Asn Ala Asn Ser Asn Ala Ala Ser Ala Ser Asn Ser 
310 315 320 325 

ACA GAC AAC AAG TCT GAA ACT TTC AAC AAG ATT AAA CTG TTA GCC ATG 
Thr Asp Asn Lys Ser Glu Thr Phe Asn Lys He Lys Leu Leu Ala Met 
330 335 340 



1853 



1901 



AAG AAA TTC CCC ACC CAT TTC CAC TAT TAC AAG AAT GAA GAC AAA CAT 
20 Lys Lys Phe Pro Thr His Phe His Tyr Tyr Lys Asn Glu Asp Lys His 

345 350 355 



1949 



AAT CCT TCA CCA GAA GAG ATC AAA CAA CAA ACT ATC TTG AAT AAT AAT 
Asn Pro Ser Pro Glu Glu He Lys Gin Gin Thr He Leu Asn Asn Asn 
360 365 370 



1997 
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GCA GCC TCT TCT TTA CCA GAG GAA TTA TTG AAC GCA CTA GAT AAA GGT 2045 
Ala Ala Ser Ser Leu Pro Glu Glu Leu Leu Asn Ala Leu Asp Lys Gly 
375 380 385 

ATG GAA AAC TTG AGA CAA CAG CAG CCG CAG CAG CAG GTC CAA ACT TCG 2093 
Met Glu Asn Leu Arg Gin Gin Gin Pro Gin Gin Gin Val Gin Ser Ser 
390 395 ">0 405 

CAG CCA CAA CCA CAG CCC CAA CAG CTA CAG CAG CAA CCA AAT GGC CAA 2141 
Gin Pro Gin Pro Gin Pro Gin Gin Leu Gin Gin Gin Pro Asn Gly Gin 
410 415 420 

AGA CCA AAT TAT TAT CCT GAA CCG TTA CTA CAG CAG CAA CAA AGA GAT 
Arg Pro Asn Tyr Tyr Pro Glu Pro Leu Leu Gin Gin Gin Gin Arg Asp 
425 430 435 

TCT CAG GAG CAA CAG CAG CAA GTT CCG ATG GCT ACA ACC AGG GCT ACT 
Ser Gin Glu Gin Gin Gin Gin Val Pro Met Ala Thr Thr Arg Ala Thr 
440 445 450 

CAG TAT CCC CCA CAA ATA AAC AGC AAT AAT TTT AAT ACT AAT CAA GCA 2285 
Gin Tyr Pro Pro Gin lie Asn Ser Asn Asn Phe Asn Thr Asn Gin Ala 
455 460 465 



TCT GTA CCT CCA CAA ATG AGA TCT AAT CCA CAA CAG CCG CCT CAA GAT 
Ser Val Pro Pro Gin Met Arg Ser Asn Pro Gin Gin Pro Pro Gin Asp 
470 



475 480 485 



AAA CCA GCT GGC CAG TCA ATT TGG TTG TAA G CAACATATAT TGCTCAAAAC 
Lys Pro Ala Gly Gin Ser lie Trp Leu * 
490 495 



2189 



2237 



2333 



2384 
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GCACAAAAAT AAACATATGT ATATATAGAC ATACACACAC ACATATATAT ATATATATTA 2644 

TTATTATTAT TTACATATAC GTACACACAA TTCCATATCG AGTTAATATA TACAATTCTG 2504 

GCCTTCTTAC CTAAAAAGAT GATAGCTAAA AGAACCACTT TTTTTATGCA TTTTTTTCTT 2564 

CGGGAAGGAA ATTAAGGGGG AGCGGAGCAC CTCTTGGCCA ATTTGTTTTT TTTTTATGTA 2624 

5 ATAAAGGGCT AACGATCGAA GATCAATCAC GAATATTGGA CGGTTTTAAA GGAGGGCCTC 2684 

TGAGAAGACA GCATCAATTC GTATTTTCGA TAATTAACTT GCCTTATAGT GTCTGATTAG 2744 

GAAACAATCA CGAGACGATA ACGACGGAAT ACCAAGGAAG TTTGTGCAAA TATACAGCCG 2804 

GCACAAACAG CAGCTTCACT CAGGTTAACT CACATACTGT TGAAAATTGT CGGTATGGAA 2864 

TTCGTTGCAG AAAGGGCTCA GCCAGTTGGT CAAACAATCC AGCAGCAAAA TGTTAATACT 2924 

10 TACGGGCAAG GCGTCCTACA ACCGCATCAT GATTTACAGC AGCGACAACA ACAACAACAG 2984 

CAGCGTCAGC ATCAACAACT GCTGACGTCT CAGTTGCCCC AGAAATCTCT CGTATCCAAA 3044 

GGCAAATATA CACTACATGA CTTCCAGATT ATGAGAACGC TTGGTACTGG ATCC 3098 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 495 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



WO 93/01205 



34. 



PCI7US92/05565 



(ii) MOLECULE TYPE: procein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Mec Asp Leu Arg Val Gly Arg Lys Phe Arg lie Gly Arg Lys lie Gly 
! 5 10 15 

5 Ser Gly Ser Phe Gly Asp He Tyr His Gly Thr Asn Leu lie Ser Gly 

20 25 30 



Val Ala lie Lys Leu Glu Ser He Arg Ser Arg His Pro Gin 
35 



Glu Glu 

40 *5 



Tyr Glu Ser Arg Val Tyr Arg Tyr Leu Ser Gly Gly Val Gly 

10 50 



Leu Asp 

55 60 



Phe lie Arg Trp Phe Gly Arg Glu Gly Glu Tyr Asn Ala MeC 



He Pro . 

80 

65 



70 75 



Val He Asp Leu Leu Gly Pro Ser Leu Glu Asp Leu Phe Asn Tyr Cys 
85 90 « 



15 His Arg Arg 

100 



Phe Ser Phe Lys Thr Val He Mec Leu Ala Leu Gin Mec 



105 H° 



Phe Cys Arg 
115 



He Gin Tyr He His Gly Arg Ser Phe He His Arg Asp 



120 125 



He Lys Pro Asp Asn Phe Leu MeC Gly Val Gly Arg Arg Gly Ser Thr 

n c 140 

20 130 i 35 
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Val His Val He Asp Phe Gly Leu Ser Lys Lys Tyr Arg Asp Phe Asn 
145 150 155 160 

Thr His Arg His He Pro Tyr Arg Glu Asn Lys Ser Leu Thr Gly Thr 
165 170 175 

5 Ala Arg Tyr Ala Ser Val Asn Thr His Leu Gly He Glu Gin Ser Arg 

180 185 190 

Arg Asp Asp Leu Glu Ser Leu Gly Tyr Val Leu He Tyr Phe Cys Lys 
195 200 205 

Gly Ser Leu Pro Trp Gin Gly Leu Lys Ala Thr Thr Lys Lys Gin Lys 
10 210 215 220 

Tyr Asp Arg He Met Glu Lys Lys Leu Asn Val Ser Val Glu Thr Leu 
225 230 235 240 

Cys Ser Gly Leu Pro Leu Glu Phe Gin Glu Tyr Met Ala Tyr Cys Lys 
245 250 255 

15 Asn Leu Lys Phe Asp Glu Lys Pro Asp Tyr Leu Phe Leu Ala Arg Leu 

260 265 270 

Phe Lys Asp Leu Ser He Lys Leu Glu Tyr His Asn Asp His Leu Phe 
275 280 285 

Asp Trp Thr Met Leu Arg Tyr Thr Lys Ala Met Val Glu Lys Gin Arg 
20 290 295 300 
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Asp Leu Leu lie Glu Lys Gly Asp Leu Asn Ala Asn Ser Asn Ala Ala 

315 320 



305 



310 



Ser Ala Ser Asn Ser Thr Asp Asn Lys Ser Glu Thr Phe Asn Lys lie 
325 330 335 

5 Lys Leu Leu Ala MeC Lys Lys Phe Pro Thr His Phe His Tyr Tyr Lys 

340 345 350 

Asn Glu Asp Lys His Asn Pro Ser Pro Glu Glu He Lys Gin Gin Thr 
355 360 365 
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lie Leu Asn Asn Asn Ala Ala Ser Ser Leu Pro Glu Glu Leu Leu Asn 
370 375 380 

Ala Leu Asp Lys Gly MeC Glu Asn Leu Arg Gin Gin Gin Pro Gin Gin 

395 400 



385 



390 



Gin Val Gin Ser Ser Gin Pro Gin Pro Gin Pro Gin Gin Leu Gin Gin 
405 ^° 415 



15 



Gin Pro Asn Gly Gin Arg Pro Asn Tyr Tyr Pro Glu Pro Leu Leu Gin 

425 «0 



420 



Gin 



Gin Gin Arg Asp Ser Gin Glu Gin Gin Gin Gin Val Pro MeC Ala 



435 



440 



445 



20 



Thr Thr Arg Ala Thr Gin Tyr Pro Pro Gin He Asn Ser Asn Asn Phe 

455 ^0 



450 
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Asn Thr Asn Gin Ala Ser Val Pro Pro Gin Met Arg Ser Asn Pro Gin 
465 470 475 480 

Gin Pro Pro Gin Asp Lys Pro Ala Gly Gin Ser He Trp Leu * 

485 490 495 • 
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The invention now being fully described, it will be apparent to one of 
ordinary skill in the art that various changes and modifications can be made 
without departing from the spirit or scope of the invention. 
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CLAIMS 

1 . A DNA sequence encoding a polypeptide with defective tyrosine kinase 
activity, wherein the polypeptide is characterized as: 

(a) promoting normal mitotic recombination; and 

(b) being essentially unable to repair a DNA double-strand break 
which occurs at the cleavage site: 

CAACAG 

GTTGTC 
t 

2. The DNA sequence of claim 1 , wherein the break is induced by an 
endonuclease. 

3. The DNA sequence of claim 2, wherein the endonuclease is HO. 

4. The DNA sequence of claim 1, selected from the group consisting of: 

(a) cDNA having a nucleotide sequence derived from the genomic 
coding region of the polypeptide; 

(b) DNA sequences encoding an amino acid sequence having at 
least about 35% homology in the protein kinase domain with the 
amino acid sequence encoded by the cDNA of (a); and 

(c) DNA sequences which are degenerate as a result of the genetic 
code with respect to the DNA sequences of (a) and (b). 
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5. The DNA sequence of claim 1 which is hrr25 gene. 

6. The DNA sequence of claim 1 , wherein the DNA sequence is derived 
from a yeast. 

7. A host cell containing the DNA sequence of claim 1. 

8. The host cell of claim 7 wherein the DNA sequence is introduced by 
transformation or transfection. 

9. A biologically functional plasmid or viral DNA vector comprising the 
DNA sequence of claim 1 . 

10. A functional polypeptide encoded by all or a portion of the DNA 
sequence of claim 1 . 

11. A polypeptide with defective tyrosine kinase activity, wherein the 
polypeptide is characterized as: 

(a) promoting normal mitotic recombination, and 

(b) being essentially unable to repair a DNA double-strand break 
including those which occur at the cleavage site: 



CAACAG 
GTTGTC 



12. The polypeptide of claim 11. wherein the break is induced by an 
endonuclease. 
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13. The polypeptide of claim 12, wherein the endonuclease is HO. 

14. An antibody to the polypeptide of claim 11. 

15. The antibody of claim 14, which is a monoclonal antibody. 

16. An antibody to the polypeptide of claim 10. 

17. The antibody of claim 16, which is a monoclonal antibody. 

18. A method of identifying DNA encoding functional polypeptide with 
tyrosine kinase activity capable of restoring DNA double-strand break 
repair activity in the host of claim 6, which comprises: 

(a) screening a library of DNA for sequences capable of producing 
the polypeptide; and 

(b) identifying DNA encoding polypeptide capable of restoring 
double-strand break activity. 

19. The method of claim 18. wherein the DNA is mammalian DNA. 

20. The method of claim 19, wherein the mammalian DNA is human DNA. 

21. The method of claim 18. wherein the screening detects structural 
similarities between members of the DNA library or expression 
products thereof and the screening means. 

22. The method of claim 21. wherein the screening means are based on 
nucleic acid structure or antigenic structure. 
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23 The method of claim 18. wherein the screening detects functional 
similarities between the expression products of the DNA library and the 
tyrosine kinase activity of the host. 

24. The method of claim 23. wherein the functional similarities are detected 
by complementation. 

25. The method of claim 24. wherein the complementation measures 
restoration of resistance to a DNA double-strand break. 

26. The method of claim 25. wherein the DNA double-strand break is 
induced by a radiomimetic alkylating agent. 

27. The method of claim 26. wherein the radiomimetic alkylating agent has 
methylmethane sulfonate activity. 

28. The method of claim 27. wherein the agent is methylmethane sulfonate. 

29. The method of claim 25. wherein the DNA double-strand break is 
induced by an endonuclease. 

30. The method of claim 29. wherein the endonuclease is HO. 

31 . The method of claim 25. wherein the DNA double-strand break is X-ray 
induced. 

32. The method of claim 18. wherein the host is a yeast. 

33. The method of claim 32. wherein the yeast is a member of the genus 
Saccharomyces. 
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34. The method of claim 33, wherein the yeast is Saccharomyces 
cerevisiae. 



35. An isolated DNA sequence encoding a polypeptide with tyrosine kinase 
activity, wherein the polypeptide is capable of restoring DNA double- 
strand break repair activity in the host of claim 6. 

36. The DNA sequence of claim 35, selected from the group consisting of: 

(a) cDNA having a nucleotide sequence derived from the genomic 
coding region of the polypeptide; 

(b) DNA sequences encoding an amino acid sequence having at 
least about 35% homology in the protein kinase domain with the 
amino acid sequence encoded by the cDNA of (a); and 

(c) DNA sequences which are degenerate as a result of the genetic 
code with respect to the DNA sequences of (a) and (b). 

37. The DNA sequence of claim 35, which is a mammalian DNA. 

38. The mammalian DNA sequence of claim 37, which is a human DNA. 

39. An isolated DNA sequence identified by the method of claim 18. 

40. An isolated polypeptide capable of restoring DNA double-strand break 
repair activity in the host of claim 6 or functional fragments thereof, 
wherein the polypeptide is free from other polypeptides with which it 
is associated in nature. 
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41. An isolated polypeptide encoded by all or a functional portion of the 
DNA sequence of claim 35. 

42. The polypeptide of claim 40, which is a mammalian polypeptide. 

43. The mammalian polypeptide of claim 42. which is a human 
polypeptide. 

44. An antibody to the polypeptide of claim 40. 

45. The antibody of claim 44, which is a monoclonal antibody. 

46. An antibody to the polypeptide of claim 41 . 

47. The antibody of claim 46, which is a monoclonal antibody. 

48. A method for identifying a composition which affects the activity of a 
mammalian polypeptide having tyrosine kinase activity, wherein the 
polypeptide is capable of restoring DNA double-strand break repair 
activity in the host of claim 6, the method comprising: 

5 (a) incubating components comprising the composition and the 

mammalian polypeptide in the presence of the host, wherein the 
incubating is carried out for a period of time and under 
conditions sufficient to allow the components to interact; and 

(b) measuring the change in tyrosine kinase activity caused by the 
10 system. 

49. The method of claim 48. wherein the change in step (b) correlates with 
decreased repair of DNA double-strand breaks. 
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50. The method of claim 49, wherein the DNA double-strand break is 
induced by a radiomimetic alkylating agent. 

51 . The method of claim 50, wherein the radiomimetic alkylating agent has 
methylmethane sulfonate activity. 

52. The method of claim 51 , wherein the agent is methylmethane sulfonate. 

53. The method of claim 49, wherein the DNA double-strand break is 
induced by an endonuclease. 

54. The method of claim 53, wherein the endonuclease is HO. 

55. The method of claim 49, wherein the DNA double-strand break is X-ray 
induced. 

56. The method of claim 48. wherein the affect of the molecule is to inhibit 
the polypeptide. 

57. The method of claim 48, wherein the mammalian polypeptide is a 
human polypeptide. 
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